median: use nth_element for linear-time selection on dense inputs #40

beingamanforever · 2026-01-22T22:00:39Z

As pointed out in the community about the hidden cost of median(x) and possible fix.

Summary

This change replaces the full sort used in median() for dense inputs with order-statistic selection via nth_element, reducing average complexity from O(n log n) to O(n). Sparse inputs retain the original sort-based code path to preserve historical behavior and output sparsity.

Algorithmic change

Previous implementation:

sort(x) followed by indexing middle element(s)

New implementation:

nth_element(x, k) for odd length
nth_element(x, [k, k+1]) for even length

This computes only the required order statistics without fully sorting.

NaN handling

NaN semantics are preserved explicitly:

"omitnan": NaNs are filtered before selection.
"includenan": slices containing NaNs return NaN.

This matches the behavior previously provided implicitly by sort()

Sparse behavior

Sparse inputs continue to use the original sort-based code path to preserve:

sparsity of outputs,
historical indexing behavior,
and memory characteristics.

Dense inputs use the new nth_element path.

I went a step ahead and ran a benchmark to see if it's really doing the speed up as promised, here are the results below. I am also attaching the benchmarking script.

Performance

Benchmarks on large dense inputs (Octave 9.x, Apple clang):

Case	Size	Before (s)	After (s)	Speedup
Vector	10,000	0.001555	0.000634	2.5×
Vector	100,000	0.008014	0.001303	6.2×
Vector	500,000	0.044739	0.005207	8.6×
Vector	1,000,000	0.095003	0.012283	7.7×
Matrix (n×100)	10,000	0.060745	0.015934	3.8×
Matrix (n×100)	100,000	0.754142	0.126997	5.9×
Matrix (n×100)	500,000	4.648894	0.590293	7.9×
Matrix (n×100)	1,000,000	9.967157	1.471443	6.8×
omitnan (n×100)	10,000	0.071202	0.015734	4.5×
omitnan (n×100)	100,000	0.717203	0.117018	6.1×
omitnan (n×100)	500,000	4.470758	0.560758	8.0×
omitnan (n×100)	1,000,000	9.405867	1.238683	7.6×

Speedup grows with input size, consistent with removal of the log n factor.

Benchmarking Script can be found here
CSV results from which plots were generated can be found here

Notes

No semantic or API changes.
Only the internal selection algorithm is modified.
All existing tests pass
All existing BISTs pass unchanged.
As mentioned in community, NaN values are taken care of properly
I can move to improving the implementation of KDTree and improve it's build time if this gets merged as median was being a bottleneck as raised in this issue

beingamanforever · 2026-01-22T22:08:12Z

Just to be in sync with the issue, please refer the community page too!

mmuetzel

Thank you for your contribution.

I only had a very cursory look at the patch and didn't run any tests yet.

For now, only a minor style request. And one comment that needs clarification.

scripts/statistics/median.m

beingamanforever · 2026-01-24T06:16:35Z

I wrote a central helper function mid_two_evals for the even/odd median calculation as this part is being used in the code multiple times, hence leading to code duplication it also handles all the special cases (Inf, integer overflow, etc.) in one place.

I changed the comments as per your suggestion too. Passes all TCs.

scripts/statistics/median.m

mmuetzel · 2026-01-27T13:35:44Z

Why did you remove the BISTs that you added previously. Are they no longer passing?

mmuetzel · 2026-01-27T13:42:02Z

scripts/statistics/median.m

+function m = mid_two_vals (m1, m2, is_int)
+  if (is_int)
+    samesign = sign (m1) == sign (m2);
+    m = samesign .* (m1 + (m2 - m1) / 2) + ! samesign .* ((m1 + m2) / 2);


I haven't timed that. But there would probably be a couple less arithmetic instructions for something like this:

m(samesign) = (m1(samesign) + (m2(samesign) - m1(samesign)) / 2); m(! samesign) = ((m1(! samesign) + m2(! samesign)) / 2);

But maybe that is outweighed by the indexing operations. But maybe worth testing (given the motivation of this change is performance).

beingamanforever · 2026-01-27T14:02:46Z

@mmuetzel They were passing,

but since those BISTs were effectively testing impossible inputs rather than real behaviour. I dropped them.

Let think of anyway of benchmarking the hot paths with extreme precision as from my experience at very minute precision the way of benchmarking becomes imp, (warmup steps, blocking sec processes, testing on multiple diff archs etc) i will add the BISTs too along with this commit + results (as there is no such thing as more BISTs !!)

mmuetzel · 2026-01-27T14:06:53Z

If we had those BISTs, you would have caught earlier that something was wrong with the implementation that you had intermediately. So, having exactly those kinds of tests is useful. Could you please add them again?

beingamanforever · 2026-01-27T14:09:30Z

done!

beingamanforever · 2026-01-27T19:39:37Z

@mmuetzel I ran a simple microbenchmark comparing the current mask-multiplication form with a masked-indexing variant. On my machine (N = 1e7), the mask-multiplication version was consistently faster (~2x). This is likely because it avoids gather/scatter and keeps contiguous memory access, whereas the masked-indexing version introduces extra memory traffic.

beingamanforever · 2026-01-29T09:03:30Z

@mmuetzel can you please review it, i also had ran a microbench to compare both approaches

median: use nth_element for linear-time selection on dense inputs

0121ee1

mmuetzel reviewed Jan 23, 2026

View reviewed changes

scripts/statistics/median.m Outdated Show resolved Hide resolved

scripts/statistics/median.m Outdated Show resolved Hide resolved

median: fix operator placement and indentation (style)

dae9985

mmuetzel reviewed Jan 23, 2026

View reviewed changes

scripts/statistics/median.m Outdated Show resolved Hide resolved

Refactor median: remove code duplication, clarify logic

9b403ee

mmuetzel reviewed Jan 24, 2026

View reviewed changes

scripts/statistics/median.m Outdated Show resolved Hide resolved

handle the inf error, add related BISTs

819c677

mmuetzel reviewed Jan 27, 2026

View reviewed changes

scripts/statistics/median.m Outdated Show resolved Hide resolved

removed the int overflow check, removed the BISTs added for inf handling

4ea191c

mmuetzel reviewed Jan 27, 2026

View reviewed changes

added the BISTs removed

d749801

beingamanforever force-pushed the median-nth-element-refactor branch from b992a65 to bc2087d Compare January 27, 2026 19:37

masked indexing for hot path

7d220c1

beingamanforever force-pushed the median-nth-element-refactor branch from bc2087d to 7d220c1 Compare January 27, 2026 19:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

median: use nth_element for linear-time selection on dense inputs #40

median: use nth_element for linear-time selection on dense inputs #40

beingamanforever commented Jan 22, 2026

Uh oh!

beingamanforever commented Jan 22, 2026 •

edited

Loading

Uh oh!

mmuetzel left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

beingamanforever commented Jan 24, 2026

Uh oh!

Uh oh!

Uh oh!

mmuetzel commented Jan 27, 2026

Uh oh!

mmuetzel Jan 27, 2026 •

edited

Loading

Uh oh!

beingamanforever commented Jan 27, 2026

Uh oh!

mmuetzel commented Jan 27, 2026 •

edited

Loading

Uh oh!

beingamanforever commented Jan 27, 2026

Uh oh!

beingamanforever commented Jan 27, 2026 •

edited

Loading

Uh oh!

beingamanforever commented Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

median: use nth_element for linear-time selection on dense inputs #40

Are you sure you want to change the base?

median: use nth_element for linear-time selection on dense inputs #40

Conversation

beingamanforever commented Jan 22, 2026

Summary

Algorithmic change

NaN handling

Sparse behavior

Performance

Notes

Uh oh!

beingamanforever commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmuetzel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

beingamanforever commented Jan 24, 2026

Uh oh!

Uh oh!

Uh oh!

mmuetzel commented Jan 27, 2026

Uh oh!

mmuetzel Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

beingamanforever commented Jan 27, 2026

Uh oh!

mmuetzel commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beingamanforever commented Jan 27, 2026

Uh oh!

beingamanforever commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beingamanforever commented Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

beingamanforever commented Jan 22, 2026 •

edited

Loading

mmuetzel Jan 27, 2026 •

edited

Loading

mmuetzel commented Jan 27, 2026 •

edited

Loading

beingamanforever commented Jan 27, 2026 •

edited

Loading